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Q ■ Abstract 

, Imperfect channel state information degrades the performance of muhiple-input muhiple-output 

(MIMO) communications; its effect on single-user (SU) and multi-user (MU) MIMO transmissions are 
quite different. In particular, MU-MIMO suffers from residual inter-user interference due to imperfect 
channel state information while SU-MIMO only suffers from a power loss. This paper compares the 
t/3 , throughput loss of both SU and MU MIMO on the downlink due to delay and channel quantization. 

Accurate closed-form approximations are derived for the achievable rates for both SU and MU MIMO. 
It is shown that SU-MIMO is relatively robust to delayed and quantized channel information, while MU- 
' MIMO with zero-forcing precoding loses spatial multiplexing gain with a fixed delay or fixed codebook 



(N 



size. Based on derived achievable rates, a mode switching algorithm is proposed that switches between 

SU and MU MIMO modes to improve the spectral efficiency, based on the average signal-to-noise ratio 

(SNR), the normalized Doppler frequency, and the channel quantization codebook size. The operating 
00 ' 

I regions for SU and MU modes with different delays and codebook sizes are determined, which can be 

used to select the preferred mode. It is shown that the MU mode is active only when the normalized 

•rH , 

. Doppler frequency is very small and the codebook size is large. 

Index Terms 

Multi-user MIMO, adaptive transmission, mode switching, imperfect channel state information at 
the transmitter (CSIT), zero-forcing precoding. 
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I. Introduction 

Over the last decade, the point-to-point multiple-input multiple-output (MIMO) link (SU- 
MIMO) has been extensively researched and has transited from a theoretical concept to a practical 
technique [1], [2]. Due to space and complexity constraints, however, current mobile terminals 
only have one or two antennas, which limits the performance of the SU-MIMO link. Multi-user 
MIMO (MU-MIMO) provides the opportunity to overcome such a limitation by communicating 
with multiple mobiles simultaneously. It effectively increases the number of equivalent spatial 
channels and provides spatial multiplexing gain proportional to the number of transmit antennas at 
the base station even with single-antenna mobiles. In addition, MU-MIMO has higher immunity 
to propagation limitations faced by SU-MIMO, such as channel rank loss and antenna correlation 
[3]. 

There are many technical challenges that must be overcome to exploit the full benefits of MU- 
MIMO. A major one is the requirement of channel state information at the transmitter (CSIT), 
which is difficult to get especially for the downlink/broadcast channel. For the MIMO downlink 
with Nt transmit antennas and Nj. receive antennas, with full CSIT the sum throughput can 
grow linearly with Nt even when Nr — 1, but without CSIT the spatial multiplexing gain is the 
same as for SU-MIMO, i.e. the throughput grows linearly with mm{Nt, Nj.) at high SNR [4]. 
Limited feedback is an efficient way to provide partial CSIT, which feeds back the quantized 
channel information to the transmitter via a low -rate feedback channel [5], [6]. However, such 
imperfect CSIT will greatly degrade the throughput gain provided by MU-MIMO [7], [8]. Besides 
quantization, there are other imperfections in the available CSIT, such as estimation error and 
feedback delay. With imperfect CSIT, it is not clear whether-or more to the point, when- MU- 
MIMO can outperform SU-MIMO. In this paper, we compare SU and MU-MIMO transmissions 
in the MIMO downlink with CSI delay and channel quantization, and propose to switch between 
SU and MU MIMO modes based on the achievable rate of each technique with practical receiver 
assumptions. 

A. Related Work 

For the MIMO downlink, CSIT is required to separate the spatial channels for different users. 
To obtain the full spatial multiplexing gain for the MU-MIMO system employing zero-forcing 
(ZF) or block-diagonalization (BD) precoding, it was shown in [7], [9] that the quantization 
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codebook size for limited feedback needs to increase linearly with SNR (in dB) and the number 
of transmit antennas. Zero-forcing dirty-paper coding and channel inversion systems with limited 
feedback were investigated in [8], where a sum rate ceiling due to a fixed codebook size was 
derived for both schemes. In [10], it was shown that to exploit multiuser diversity for ZF, both 
channel direction and information about signal-to-interference-plus-noise ratio (SINR) must be 
fed back. More recently, a comprehensive study of the MIMO downlink with ZF precoding was 
done in [11], which considered downlink training and explicit channel feedback and concluded 
that significant downlink throughput is achievable with efficient CSI feedback. For a compound 
MIMO broadcast channel, the information theoretic analysis in [12] showed that scaling the 
CSIT quality such that the CSIT error is dominated by the inverse of the SNR is both necessary 
and sufficient to achieve the full spatial multiplexing gain. 

Although previous studies show that the spatial multiplexing gain of MU-MIMO can be 
achieved with limited feedback, it requires the codebook size to increase with SNR and the 
number of transmit antennas. Even if such a requirement is satisfied, there is an inevitable rate 
loss due to quantization error, plus other CSIT imperfections such as estimation error and delay. 
In addition, most of prior work focused on the achievable spatial multiplexing gain, mainly 
based on the analysis of the rate loss due to imperfect CSIT, which is usually a loose bound [7], 
[9], [12]. Such analysis cannot accurately characterize the throughput loss, and no comparison 
with SU-MIMO has been made. In this paper, we derive good approximations for the achievable 
throughput for both SU and MU MIMO systems with fixed channel information accuracy, i.e. 
with a fixed delay and a fixed quantization codebook size. We are interested in the following 
question: With imperfect CSIT, including delay and channel quantization, when can MU-MIMO 
actually deliver a throughput gain over SU-MIMO? Based on this, we can select the one with 
the higher throughput as the transmission technique. 

B. Contributions 

In this paper, we investigate SU and MU-MIMO in the broadcast channel with CSI delay and 
limited feedback. The main contributions of this paper are as follows. 

• SU vs. MU Analysis. We investigate the impact of imperfect CSIT due to delay and channel 
quantization. We show that the SU mode is more robust to imperfect CSIT as it only suffers 
a constant rate loss, while MU-MIMO suffers more severely from the residual inter-user 
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interference. We characterize the residual interference due to delay and channel quantization, 
which shows these two effects are equivalent. Based on an independence approximation of 
the interference terms and the signal term, accurate closed-form approximations are derived 
for the ergodic rates for both SU and MU MIMO modes. 

• Mode Switching Algorithm. A SU/MU mode switching algorithm is proposed based on the 
ergodic sum rate as a function of the average SNR, normalized Doppler frequency, and the 
quantization codebook size. This transmission technique only requires a small number of users 
to feed back instantaneous channel information. The mode switching points can be calculated 
from the previously derived approximations for ergodic rates. 

• Operating Regions. The operating regions for SU and MU modes are determined, from which 
we can determine the active mode and find the condition that activates each mode. With a fixed 
delay and codebook size, if the MU mode is possible at all, there are two mode switching 
points, with the SU mode preferred at both low and high SNRs. The MU mode will only 
be activated when the normalized Doppler frequency is very small and the codebook size is 
large. From the numerical results, the minimum feedback bits per user to get the MU mode 
activated grow approximately linearly with the number of transmit antennas. 

The rest of the paper is organized as follows. The system model and some assumptions are 
presented in Section [III The transmission techniques for both SU and MU MIMO modes are 
described in Section [nil The rate analysis for both SU and MU modes and the mode switching 
are done in Section |lVl Numerical results and conclusions are in Section |V] and |Vll respectively. 

IL System Model 

We consider a MIMO downlink, where the transmitter (the base station) has Nt antennas and 
each mobile user has a single antenna. The system parameters are listed in Table HI During each 
transmission period, which is less than the channel coherence time and the channel is assumed 
to be constant, the base station transmits to one (SU-MIMO mode) or multiple (MU-MIMO 
mode) users. The discrete-time complex baseband received signal at the u-th user at time n is 



5 



given a! 



u 



yu[n] 



KM X] ^u'[n\xu>[n] + Zu[n] 



(1) 



u'=l 



where hu[?T,] is the A^^^ x 1 channel vector from the transmitter to the u-th user, and Zu[n\ is 
the normalized complex Gaussian noise vector, i.e. Zu[n\ ~ CA/'(0, 1). Xu[n] and f„[?2] are the 
transmit signal and A^t x 1 precoding vector for the w-th user, respectively. The transmit power 
constraint is E {x*[r;,]x[n]} = P, where x[n] = [x];,a;2, ■ ■ ■ As the noise is normalized, P 

is also the average transmit SNR. 

To assist the analysis, we assume that the channel hu[n] is well modeled as a spatially white 
Gaussian channel, with entries hi^j[n] ~ CA/'(0, 1), and the channels are i.i.d. over different users. 
The results will be different for different channel models. For example, a limited feedback system 
with line of sight MIMO channel requires fewer feedback bits compared to the Rayleigh channel 
[13]. The investigation of other channel models is left to future work. 

We consider two of the main sources of the CSIT imperfection-delay and quantization erroi^ 
specified as follows. 

A. CSI Delay Model 

We consider a stationary ergodic Gauss-Markov block fading process [14, Sec. 16-1], where 
the channel stays constant for a symbol duration and changes from symbol to symbol according 
to 



where e[n\ is the channel error vector, with i.i.d. entries ei[n] ~ CA/'(0, e^), and it is uncorrected 
with h[n— 1]. We assume the CSI delay is of one symbol. It is straightforward to extend the results 
to the scenario with a delay of multiple symbols. For the numerical analysis, the classical Clarke's 
isotropic scattering model will be used as an example, for which the correlation coefficient is 

'in this paper, we use uppercase boldface letters for matrices (X) and lowercase boldface for vectors (x). E[-] is the expectation 
operator. The conjugate transpose of a matrix X (vector x) is X* (x*). Similarly, X^ denotes the pseudo-inverse, x denotes 
the normalized vector of x, i.e. x = -jj^, and x denotes the quantized vector of x. 

^For a practical system, the feedback bits for each user is usually fixed, and there will inevitably be delay in the available 
CSI, both of which are difficult or even impossible to adjust. Other effects such as channel estimation error can be made small 
such as by increasing the transmit power or the number of pilot symbols. 



h.[n\ = ph[n — 1] + G[n] 



(2) 
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p = Jo{2nfdTs) with Doppler spread fd [15], where Jo(-) is the zero-th order Bessel function 
of the first kind. The variance of the error vector is = 1 — p^. Therefore, both p and Ce are 
determined by the normalized Doppler frequency fdTg. 

The channel in ^ is widely-used to model the time-varying channel. For example, it is used to 
investigate the impact of feedback delay on the performance of closed-loop transmit diversity in 
[16] and the system capacity and bit error rate of point-to-point MIMO link in [17]. It simplifies 
the analysis, and the results can be easily extended to other scenarios. Essentially, this model is 
of the form 

h[n] = g[n] + e[n], (3) 

where g[n] is the available CSI at time n with an uncorrected error vector e[n], g[n\ ~ 
CJ\f{0, (1 — 6^)1), and e[n] ~ CJ\f{0, ep). It can be used to consider the effect of other imperfect 
CSIT, such as estimation error and analog feedback. The difference is in e[n], which has different 
variance el for different scenarios. Some examples are given as follows. 

a) Estimation Error: If the receiver obtains the CSI through MMSE estimation from Tp 
pilot symbols, the error variance is el = Yipr^' where 7p is the SNR of the pilot symbol [18]. 

b) Analog Feedback: For analog feedback, the error variance is ej = -r—^ — , where r„/ is 
the number of channel uses per channel coefficient and 'jui is the SNR on the uplink feedback 
channel [19]. 

c) Analog Feedback with Prediction: As shown in [20], for analog feedback with a d- 
step MMSE predictor and the Gauss-Markov model, the error variance is el = p^'^e^ + (1 — 
P^) YliZl P^'' where p is the same as in Q and cq is the Kalman filtering mean-square error. 

Therefore, the results in this paper can be easily extended to these systems. In the following 
parts, we focus on the effect of CSI delay. 

B. Channel Quantization Model 

We consider frequency-division duplexing (FDD) systems, where limited feedback techniques 
provide partial CSIT through a dedicated feedback channel from the receiver to the transmitter. 
The channel direction information for the precoder design is fed back using a quantization 
codebook known at both the transmitter and receiver. 

The quantization is chosen from a codebook of unit norm vectors of size L = 2^ . We assume 
each user uses a different codebook to avoid the same quantization vector. The codebook for 
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user u is Cu = {Cu,i, c„,2, ■ ■ ■ , Cu,l}- Each user quantizes its channel to the closest codeword, 
where closeness is measured by the inner product. Therefore, the index of channel for user u is 

4 = arg max |h*c„,£|. (4) 
i<e<L 

Each user needs to feed back B bits to denote this index, and the transmitter has the quantized 
channel information = Cuju- As the optimal vector quantizer for this problem is not known 
in general, random vector quantization (RVQ) [21] is used, where each quantization vector is 
independently chosen from the isotropic distribution on the A^t -dimensional unit sphere. It has 
been shown in [7] that RVQ can facilitate the analysis and provide performance close to the 
optimal quantization. In this paper, we analyze the achievable rate averaged over both RVQ-based 
random codebooks and fading distributions. 

An important metric for the limited feedback system is the squared angular distortion, defined 
as sin^ (0^) = 1 - IKKI^' where 6l„ = Z (h^, h„^. With RVQ, it was shown in [7], [22] that 
the expectation in i.i.d. Rayleigh fading is given by 

E, [sin^ {6^)] ^2^.(3 (2^, -j^^ , (5) 

where /3{-) is the beta function. It can be tightly bounded as [7] 

Nt-1 



2~~^ < E [sin^ {Ou)] < 2~~K (6) 



III. Transmission Techniques 

In this section, we describe the transmission techniques for both SU and MU MIMO systems 
with perfect CSIT, which will be used in the subsequent sections for imperfect CSIT systems. 
By doing this, we focus on the impacts of imperfect CSIT on the conventional transmission 
techniques. Designing imperfect CSIT-aware precoders is left to future work. Throughout this 
paper, we use the achievable ergodic rate as the performance metric for both SU and MU-MIMO 
systems. The base station transmits to a single user {U = 1) for the SU-MIMO system and to 
Nt users {U = Nt) for the MU-MIMO system. The SU/MU mode switching algorithm is also 
described. 
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A. SU-MIMO System 

With perfect CSIT, it is optimal for the SU-MIMO system to transmit along the channel 
direction [1], i.e. selecting the beamforming (BF) vector as f[n] = h[n], denoted as eigen- 
beamforming in this paper. The ergodic capacity of this system is the same as that of a maximal 
ratio combining diversity system, given by [23] 

Rbf{P) = Eh [log, (1 + P||h[n]f )] = \og,{e)e'/^ ^^~pl^^\ (7) 

fc=0 

where r(-, ■) is the complementary incomplete gamma function defined as r(a, x) = t°'~^e~^dt 

B. MU-MIMO System 

For MIMO broadcast channels, although dirty-paper coding (DPC) [24] is optimal [25]-[29], 
it is difficult to implement in practice. As in [7], [11], ZF precoding is used in this paper, 
which is a linear precoding technique that precancels inter-user interference at the transmitter. 
There are several reasons for us to use this simple transmission technique. Firstly, due to its 
simple structure, it is possible to derive closed-form results, which can provide helpful insights. 
Second, the ZF precoding is able to provide full spatial multiplexing gain and only has a power 
offset compared to the optimal DPC system [30]. In addition, it was shown in [30] that the 
ZF precoding is optimal among the set of all linear precoders at asymptotically high SNR. In 
Section |Vl we will show that our results for the ZF system also apply for the regularized ZF 
precoding [31], which provides a higher throughput than the ZF precoding at low to moderate 
SNRs. 

With precoding vectors fu[n], m = 1, 2, ■ ■ ■ , [/, assuming equal power allocatioj^, 
SINR for the w-th user is given as 



the received 



1ZF,u 



This is true for a general linear precoding MU-MIMO system. With perfect CSIT, this quantity 
can be calculated at the transmitter, while with imperfect CSIT, it can be estimated at the receiver 
and fed back to the transmitter given knowledge of [n] . 



^At high SNR, this performs closely to the system employing optimal water-filling, as power allocation mainly benefits at 
low SNR. 
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Denote tl[n] = [hi [n] , h2 [n] , ■ ■ ■ ,h[7[n]]*. With perfect CSIT, the ZF preceding vectors are 
determined from the pseudo-inverse of H[n], as F[n] = H^[n] = H*[?i](H[?2]H*[n])^^ The 
precoding vector for the u-th user is obtained by normalizing the u-th column of F[n]. Therefore, 
h*[n]fii/[n] = 0, Vu 7^ u', i.e. there is no inter-user interference. The received SINR for the u-th 
user becomes 

jzF,u=^\K[n]W\'. (8) 

As f„[n] is independent of h„[r;,], and ||fu[r?,]|p = 1, the effective channel for the u-th user is a 
single-input single-output (SISO) Rayleigh fading channel. Therefore, the achievable sum rate 
for the ZF system is given by 

u 

Rzf{P) =J2^y [log2(l + lZF,u)] . (9) 
u=l 

Each term on the right hand side of ^ is the ergodic capacity of a SISO system in Rayleigh 
fading, given in [23] as 

RzF,u = [log2(l + JzF,u)] = log2(e)e^/^Ei(f//P), (10) 
where Ei{-) is the exponential-integral function of the first order, Ei{x) = ^-^dt. 

C. SU/MU Mode Switching 

Imperfect CSIT will degrade the performance of the MIMO communication. In this case, it is 
unclear whether and when the MU-MIMO system can actually provide a throughput gain over 
the SU-MIMO system. Based on the analysis of the achievable ergodic rates in this paper, we 
propose to switch between SU and MU modes and select the one with the higher achievable 
rate. 

The channel correlation coefficient p, which captures the CSI delay effect, usually varies 
slowly. The quantization codebook size is normally fixed for a given system. Therefore, it is 
reasonable to assume that the transmitter has knowledge of both delay and channel quantization, 
and can estimate the achievable ergodic rates of both SU and MU MIMO modes. Then it can 
determine the active mode and select one (SU mode) or Nt (MU mode) users to serve. This is a 
low-complexity transmission strategy, and can be combined with random user selection, round- 
robin scheduling, or scheduling based on queue length rather than channel status. It only requires 
the selected users to feed back instantaneous channel information. Therefore, it is suitable for a 
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system that has a constraint on the total feedback bits and only allows a small number of users 
to send feedback, or a system with a strict delay constraint that cannot employ opportunistic 
scheduling based on instantaneous channel information. 

To determine the transmission rate, the transmitter sends pilot symbols, from which the active 
users estimate the received SINRs and feed back them to the transmitter. In this paper, we 
assume the transmitter knows perfectly the actual received SINR at each active user. In practice, 
there will inevitably be errors in such information due to estimation error and feedback delay, 
which will result in rate mismatch, i.e. the transmission rate based on the estimated SINR does 
not match the actual SINR on the channel, so there will be outage events. How to deal with 
such rate mismatch is of practical importance, and we mention several possible approaches 
as follows. The full investigation of this issue requires further research and is out of scope 
of this paper. Considering the outage events, the transmission strategy can be designed based 
on the actual information symbols successfully delivered to the receiver, denoted as goodput 
in [32], [33]. With the estimated SINR, another approach is to back off on the transmission 
rate based on the variance of the estimation error, as did in [34], [35] for the single-antenna 
opportunistic scheduling system and in [36] for the multiple-antenna opportunistic beamforming 
system. Combined with user selection, the transmission rate can also be determined based on 
some lower bound of the actual SINR to make sure that no outage occurs, as did in [37] for the 
limited feedback system. 

IV. Performance Analysis and Mode Switching 

In this section, we investigate the achievable ergodic rates for both SU and MU MIMO modes. 
We first analyze the average received SNR for the BF system and the average residual interference 
for the ZF system, which provide insights on the impact of imperfect CSIT. To select the active 
mode, accurate closed-form approximations for both SU and MU modes are then derived. 

A. SU Mode-Eigen-Beamforming 

First, if there is no delay and only channel quantization, the BF vector is based on the quantized 
feedback, i^^\n] = h[n]. The average received SNR is 

SNRg^ = Eh,c[P|h*[n]h[n]|2] 
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Eh,c[P||hMf |h*NhM|2 
Nt-1 



(a) 

<PNt[l 



N, 



-2 



(11) 



where (a) follows the independence between ||h[n]|p and |h*[n]h[n]p, together with the result 
in ©. 

With both delay and channel quantization, the BF vector is based on the quantized channel 
direction with delay, i.e. i^^'^^ln] = h[n — l]. The instantaneous received SNR for the BF system 



SNR 



iQD) 
BF 



P 



n\ 



(12) 



Based on (fTT)) . we get the following theorem on the average received SNR for the SU mode. 
Theorem 1: The average received SNR for a BF system with channel quantization and CSI 
delay is 

SNRSf ^ < PN, (p^Ag? + Aig) , (13) 

where A^ and A^ show the impact of channel quantization and feedback delay, respectively, 
given by 



'■BF 



N't 



Proof: See Appendix |Bl ■ 
From Jensen's inequality, an upper bound of the achievable rate for the BF system with both 
quantization and delay is given by 



R 



(QD) 
BF 



< log2 

< logs 



log2 ( 1 + SNRgf 



l + SNR^r 



l + PiV,(p2AW) + A^7 



AD) 



(14) 



Remark 1: Note that = 1 — e^, so the average SNR decreases with e^. With a fixed B 
and fixed delay, the SNR degradation is a constant factor independent of P. At high SNR, the 
imperfect CSIT introduces a constant rate loss logs {^P^^^bf + ^bf)- 

The upper bound provided by Jensen's inequality is not tight. To get a better approximation 
for the achievable rate, we first make the following approximation on the instantaneous received 
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SNR 

SNRgf^ = P|h*[n]h[n-l]p 

= P\{ph[n - 1] + e[n])*h[n - Ijj^ 

RiPp>*[r^-l]h[r^-l]|^ (15) 

i.e. we remove the term with e[n] as it is normally insignificant compared to ph[n — 1]. This will 
be verified later by simulation. In this way, the system is approximated as the one with limited 
feedback and with equivalent SNR p^P. 

From [22], the achievable rate of the limited feedback BF system is given by 

\ A;=0 

- £ (1 - (1 - .r'-r' ^^"•"E..,. {£) <Lr) . (16) 

where En{x) — e~^^x~^dt is the n-th order exponential integral. So R^^p^ can be approxi- 
mated as 

R^S^\p)^R^Skp'P)- (IV) 

As a special case, considering a system with delay only, e.g. the time-division duplexing 
(TDD) system which can estimate the CSI from the uplink with channel reciprocity but with 
propagation and processing delay, the BF vector is based on the delayed channel direction, i.e. 
f^^)[n] = h[n — 1]. We provide a good approximation for the achievable rate for such a system 
as follows. 

The instantaneous received SNR is given as 

SNr£^ =P|h*[n]f(^)[n]|2 

= P\{ph[n - 1] + e[n])*h[n - 1]!^ 

S Pplh[n - 1]||' + P\e*[n]h.[n - 1]|^ (18) 

In step (a) we eliminate the cross terms since e[n] is normally small. As e[n] is independent of 
h[n - 1], e[n] ~ CA/'(0, 6^1) and ||h[n - 1]||^ = 1, we have |e*[n]h[n - 1]^ ~ where Xm 
denotes chi-square distribution with M degrees of freedom. In addition, ||h[n — 1]||^ ~ X2Art> 
and it is independent of |e*[n]h[n — Then the following theorem can be derived. 
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Theorem 2: The achievable ergodic rate of the BF system with delay can be approximated as 



4f ~ log2 (6)00^*6^/"^ El 



1 



Nt~l i Nt-l- 
Ctn 



.V2^ 

- iog2 (e)(i -ao)Y.Y. T^rnT^i^^^'^^^^V^i' - (1^) 

i=0 1=0 ^ 

where r]i = Pp^, r]2 = P^l, cto = r]2-m ' ^'^^ ' given in (l36l) in Appendix lAl 

Proof: See Appendix O ■ 

5. Zero-Forcing 

1) Average Residual Interference: If there is no delay and only channel quantization, the 
precoding vectors for the ZF system are designed based on hi[n], h2[n], ■ ■ ■ , h.u[n\ to achieve 
h* [njf^?'' [n] = 0, Vm 7^ u' . With random vector quantization, it is shown in [7] that the average 
noise plus interference for each user is 



U 



1 + 2 ^P. (20) 



With both channel quantization and CSI delay, precoding vectors are designed based on hi [n — 
1], h2[n - 1], ■ ■ ■ , h[/[r2 - 1] and achieve h;[n- l]f^?^^[n] = 0, Vu 7^ u' . The received SINR for 
the M-th user is given as 



As £^'^^''[72] is in the nuUspace of h„'[n — 1] Vm' 7^ u, it is isotropically distributed in C^' and 
independent of h„[n — 1] as well as h„[n], so |h* [njfl'^^'' [n] p ~ xi- The average noise plus 
interference is given in the following theorem. 

Theorem 3: The average noise plus interference for the n-th user of the ZF system with both 
channel quantization and CSI delay is 

Ag?' = 1 + (C/ - 1)^ (pSA<«„ + A™„) , (22) 

where and f^^Fu degradations brought by channel quantization and feedback 

delay, respectively, given by 

^(Q) _ ^ A^"") -e^ 
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Proof: The proof is similar to the one for Theorem [7] in appendix |Bl ■ 
Remark 2: From Theorem \3\ we see that the average residual interference for a given user 
consists of three parts: 

(i) The number ofinterferers, U — 1. The more users the system supports, the higher the mutual 
interference. 

(ii) The transmit power of the other active users, jj. As the transmit power increases, the 
system becomes interference-limited. It is possible to improve performance through power 
allocation, which is left to future work. 

(iii) The CSIT accuracy for this user, which is reflected from p^A^^ + A^^^. The user with 
a larger delay or a smaller codebook size suffers a higher residual interference. 

From this remark, the interference term, jj{U — l)eg„, equivalently comes from U — \ virtual 
interfering users, each with equivalent SNR as ^ ^p^A^^ + A^^^j . With a high P and a 
fixed ee,u or B, the system is interference-limited and cannot achieve full spatial multiplexing 
gain. Therefore, to keep a constant rate loss, i.e. to sustain the spatial multiplexing gain, the 
channel error due to both quantization and delay needs to be reduced as SNR increases. Similar 
to the result for the limited feedback system in [7], for the ZF system with both delay and 
channel quantization, we can get the following corollary for the condition to achieve the full 
spatial multiplexing gain. 

Corollary 1: To keep a constant rate loss of log2(5o bps/Hz for each user, the codebook size 
and CSI delay need to satisfy the following condition 

2 . (Q) , .(D) _ U _ ^0-1 .23^ 

Proof: As shown in [7], [11], the rate loss for each user due to imperfect CSIT is upper 
bounded by ARu < log2 A^^^. The corollary follows from solving logg A^^^ = log2 5o- ■ 
Equivalently, this means that for a given p^, the feedback bits per user needs to scale as 

As 1, i.e. there is no CSI delay, the condition becomes B = {Nt — 1) log2 s—j^ which 

agrees with the result in [7] with limited feedback only. 
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2) Achievable Rate: For the ZF system with imperfect CSI, the genie-aided upper bound for 



the ergodic achievable rateQ is given by [11] 

u 



RfpZ- (25) 



u=l 



We assume the mobile users can perfectly estimate the noise and interference and feed back it 
to the transmitter, so the upper bound is chosen as the performance metric, i.e. R^^j!^^ = R^zplb^ 
as in [7], [8], [10]. 

The following lower bound based on the rate loss analysis is used in [7], [11] 

R%^'>RzF-j:^\og,^fZ (26) 

u=l 

where Rzf is the achievable rate with perfect CSIT, given in dH). However, this lower bound is 
very loose. In the following, we will derive a more accurate approximation for the achievable 
rate for the ZF system. 

To get a good approximation for the achievable rate for the ZF system, we first approximate 
the instantaneous SINR as 



(QD) 



p 
u 




l + (7 \l2u'j^u Pu 


h*[n - 







(27) 

i.e. eliminating the interference terms which have both hu[n — 1] and e^fn] as eu[n] is normally 
very small, so we get two separate interference sums due to delay and quantization, respectively. 

For the interference term due to delay, |e*[n]f^?^^[n]p ~ xh ^[^] independent of 
{^^\n] and ||fu?^''H||^ = 1- For the interference term due to quantization, it was shown in 
[7] that |h* [ra — l]f^?^''[r2]p is equivalent to the product of the quantization error sin^ 6u and an 
independent /?(!, Nt — 2) random variable. Therefore, we have 

\K[n - l]ii?''\n] = \\K[n - l]f (sin^^,) ■ /3(l,iV, - 2). (28) 



''This upper bound is acliievable only wlien a genie provides users with perfect knowledge of all interference and the transmitter 
knows perfectly the received SINR at each user. 
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In [10], with a quantization cell approximatioij^ [38], [39], it was shown that ||h„[n— 1] |p(sin^ 6^) 

B 

has a Gamma distribution with parameters (iVt — l,S), where 6 = 2 ^t-^. As shown in [10] 
the analysis based on the quantization cell approximation is close to the performance of random 
vector quantization, so we use this approach to derive the achievable rate. 

The following lemma gives the distribution of the interference term due to quantization. 

Lemma 1: Based on the quantization cell approximation, the interference term due to quan- 
tization in (|27]) . \hu[n — l]f^?^^[n]p, is an exponential random variable with mean 6, i.e. its 
probability distribution function (pdf) is 

= ie-^/^x > 0. (29) 



Proof: See Appendix |Dl ■ 
Remark 3: From this lemma, we see that the residual interference terms due to both delay 
and quantization are exponential random variables, which means the delay and quantization error 
have equivalent effects, only with different means. By comparing the means of these two terms, 
i.e. comparing and 2 ^^t-i, we can find the dominant one. In addition, with this result, we 
can approximate the achievable rate of the ZF limited feedback system, which will be provided 
later in this section. 

Based on the distribution of the interference terms, the approximation for the achievable rate 
for the MU mode is given in the following theorem. 

Theorem 4: The ergodic achievable rate for the u-th user in the MU mode with both delay 
and channel quantization can be approximated as 



A/-1 2 



i=0 j=l 



(1) o.^ .(2) 



(30) 



where a = P = ^, 6i = pl6, 62 = e^ „, M = Nt - 1, al ' and a\ ' are given in (gH) and (|45l) , 
and /3(-, - , ■) is given in (1381) in Appendix [Al 

Proof: See Appendix El ■ 
The ergodic sum throughput is 

u=l 



^The quantization cell approximation is based on the ideal assumption that each quantization cell is a Voronoi region on a 
spherical cap with the surface area 2^^ of the total area of the unit sphere for a B bits codebook. The detail can be found in 
[10], [38], [39]. 
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As a special case, for a ZF system with delay only, we can get the following approximation 
for the ergodic achievable rate. 

Corollary 2: The ergodic achievable rate for the n-th user in the ZF system with delay is 
approximated as 

/ N -{A/-1) / , X 

4%«log.(e)(^) .,3(^_,^,M-l), ,32) 

where a = jj, (3 = M = Nt - I, and /3(-, -, ■) is given in ([38]) in Appendix |Al 

Proof: Following the same steps in Appendix |E] with 5i = 0. ■ 
Remark 4: As shown in Lemma C] the effects of delay and channel quantization are equivalent, 
so the approximation in (l32l) also applies for the limited feedback system. This is verified by 
simulation in Fig. [H which shows that this approximation is very accurate and can be used to 
analyze the limited feedback system. 

C. Mode Switching 

We first verify the approximation (|30l) in Fig. [2l which compares the approximation with 
simulation results and the lower bound (l26l) . with i? = 10, f = 20 km/hr, /c = 2 GHz, and 
Tg = 1 msec. We see that the lower bound is very loose, while the approximation is accurate 
especially for A^^ = 2. In fact, the approximation turns out to be a lower bound. Note that due 
to the imperfect CSIT, the sum rate reduces with Nf. 

In Fig. [3l we compare the BF and ZF systems, with B = 18, fc = 2 GHz, = 10 km/hr, and 
Ts = I msec. We see that the approximation for the BF system almost matches the simulation 
exactly. The approximation for the ZF system is accurate at low to medium SNRs, and becomes 
a lower bound at high SNR, which is approximately 0.7 bps/Hz in total, or 0.175 bps/Hz per 
user, lower than the simulation. The throughput of the ZF system is limited by the residual 
inter-user interference at high SNR, where it is lower than the BF system. This motivates to 
switch between the SU and MU MIMO modes. The approximations (flTI) and (|30l) will be used 
to calculate the mode switching points. There may be two switching points for the system with 
delay, as the SU mode will be selected at both low and high SNR. These two points can be 
calculated by providing different initial values to the nonlinear equation solver, such as fsolve 
in MATLAB. 
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V. Numerical Results 

In this section, numerical results are presented. First, the operating regions for different modes 
are plotted, which show the impact of different parameters, including the normalized Doppler 
frequency, the codebook size, and the number of transmit antennas. Then the extension of our 
results for the ZF precoding to the MMSE precoding is demonstrated. 

A. Operating Regions 

As shown in Section ITV-Ci finding mode switching points requires solving a nonlinear equation, 
which does not have a closed-form solution and gives little insight. However, it is easy to evaluate 
numerically for different parameters, from which insights can be drawn. In this section, with 
the calculated mode switching points for different parameters, we plot the operating regions for 
both SU and MU modes. The active mode for the given parameter and the condition to activate 
each mode can be found from such plots. 

In Fig. m the operating regions for both SU and MU modes are plotted, for different normalized 
Doppler frequencies and different number of feedback bits in Fig. |4(a)| and Fig. |4(b)| , respectively. 



and with U = Nt = A. There are analogies between two plots. Some key observations are as 
follows: 

(i) For the delay plot Fig. |4(a)[ comparing the two curves for B = \Q and B = 20, we see 



that the smaller the codebook size, the smaller the operating region for the ZF mode. For 
the ZF mode to be active, fiiTs needs to be small, specifically we need /^T^ < 0.055 
and fdTs < 0.046 for B = 20 and B = 16, respectively. These conditions are not easily 
satisfied in practical systems. For example, with carrier frequency /c = 2 GHz, mobility 
t> = 20 km/hr, the Doppler frequency is 37 Hz, and then to satisfy f^Tg < 0.055 the delay 
should be less than 1.5 msec. 

(ii) For the codebook size plot Fig. |4(b)[ comparing the two curves with v = 10 km/hr and 
V = 20 km/hr, as fdTs increases (v increases), the ZF operating region shrinks. For the ZF 
mode to be active, we should have B > 12 and B > lA for t> = 10 km/hr and v = 20 
km/hr, respectively, which means a large codebook size. Note that for BF we only need a 
small codebook size to get the near-optimal performance [5]. 

(iii) For a given fdTs and B, the SU mode will be active at both low and high SNRs, which is 
due to its array gain and the robustness to imperfect CSIT, respectively. 
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The operating regions for different Nt are shown in Fig. [51 We see that as A^^^ increases, the 
operating region for the MU mode shrinks. Specifically, we need B > 12 for Nt = 4:, B > 19 
for Nt = 6, and B > 26 for Nt = 8 to get the MU mode activated. Note that the minimum 
required feedback bits per user for the MU mode grow approximately linearly with Nt. 

B. ZF vs. MMSE Precoding 

It is shown in [31] that the regularized ZF precoding, denoted as MMSE precoding in this 
paper, can significantly increase the throughput at low SNR. In this section, we show that our 
results on mode switching with ZF precoding can also be applied to MMSE precoding. 



From this, we see that the MMSE precoders converge to ZF precoders at high SNR. Therefore, 
our derivations for the ZF system also apply to the MMSE system at high SNR. 



In Fig. |6l we compare the performance of ZF and MMSE precoding systems with dela>|j. We 
see that the MMSE precoding outperforms ZF at low to medium SNRs, and converges to ZF at 
high SNR while converges to BE at low SNR. In addition, it has the same rate ceiling as the 
ZF system, and crosses the BF curve roughly at the same point, after which we need to switch 
to the SU mode. Based on this, we can use the second predicted mode switching point (the one 
at higher SNR) of the ZF system for the MMSE system. We compare the simulation results 
and calculation results by (fT9l) and (|32l) for the mode switching points in Table [III For the ZF 
system, it is the second switching point; for the MMSE system, it is the only switching point. 
We see that the switching points for MMSE and ZF systems are very close, and the calculated 
ones are roughly 2.5 ~ 3 dB lower. 

VI. Conclusions 

In this paper, we compare the SU and MU MIMO transmissions in the broadcast channel with 
delayed and quantized CSIT, where the amount of delay and the number of feedback bits per 



Denote H[?2] = hi[n], h2[n], ■ ■ ■ , h.uW\ ■ Then the MMSE precoding vectors are chosen to 
be the normalized columns of the matrix [31] 




(33) 



*This can also be done in the system with both delay and quantization, which is more time-consuming. As shown in Lemma 
[7] the effects of delay and quantization are equivalent, so the conclusion will be the same. 
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user are fixed. The throughput of MU-MIMO saturates at high SNR due to residual inter-user 
interference, for which a SU/MU mode switching algorithm is proposed. We derive accurate 
closed-form approximations for the ergodic rates for both SU and MU modes, which are then 
used to calculate the mode switching points. It is shown that the MU mode is only possible to 
be active in the medium SNR regime, with a small normalized Doppler frequency and a large 
codebook size. 

For future work, the MU-MIMO mode studied in this paper is designed with zero-forcing 
criterion, which is shown to be sensitive to CSI imperfections, so robust precoding design is 
needed and the impact of the imperfect CSIT on non-linear precoding should be investigated. 
As power control is an effective way to combat interference, it is interesting to consider the 
efficient power control algorithm rather than equal power allocation to improve the performance, 
especially in the heterogeneous scenario. It is also of practical importance to investigate possible 
approaches to improve the quality of the available CSIT with a fixed codebook size, e.g. through 
channel prediction. 



A. Useful Results for Rate Analysis 

In this Appendix, we present some useful results that are used for rate analysis in this paper. 
The following lemma will be used frequently in the derivation of the achievable rate. 

Lemma 2: For a random variable x with probability distribution function (pdf) fx{x) and 
cumulative distribution function (cdf) Fx{x), we have 



where g' is the derivative of the function g, and step (a) follows the integration by parts. ■ 
The following lemma provides some useful integrals for rate analysis, which can be derived 
from the results in [40]. 



Appendix 



Jo ^ 

Proof: The proof follows the integration by parts. 




(34) 




(35) 
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Lemma 3: 

/i(a, b, m) 

hia, 6, m) 



X + b 



dx = j2i^-'^)K-by 



m—k —k 
(Jj 



ir-^b"'e''''Ei{ab) (36) 



k=l 



oo g— ax 



-dx 



l3{a,b,m) 



+ by 

e^'^Eiiab) m = 1 

V^m-l (fc-1)! (-a)'"-*-! I (-a)'"-l ab TP / U\ \ o 

e 



(m-l)! ftfc 
-ax 



_ b)-' . /2 (a, 6, m - z + 1) + (6 - 1)-™ ■ h (a, 1, 1) 



i=l 



where Ei(x) is the exponential-integral function of the first order. 

B. Proof of Theorem [7] 
The average SNR is 



E 
PE 



P 



n\ 



{ph[n - 1] + e[n])*h[?2 - 1] 



(a) 



(fe) 



P|ph*[r2 - l\h.[n - 1]|^ + P|e*[n]h[n - l]p 



< PiVtp^ (^1 - j + PE 

PiVtp' f 1 - 2"^) + Pel, 



(37) 



(38) 



|h*[n - 1] ■ [e[n]e*[n]] ■ h[n - 1]| 



As e[n] is independent of h[n — 1], it is also independent of h.\n — 1], which gives (a). Step (b) 
follows ([n])- Step (c) is from the fact e[n] ~ CA/'(0, e^lArJ and |h[n - 1]^ = 1. 



C. Proof of Theorem |2] 

Denote yi = \\h[n - and y2 = \\e*[n]h[n - 1]^, then yi ~ xItv,. 1/2 ~ xi' and they 
are independent. The received SNR can be written as x = r/i^/i + r]2y2, where % = Pp^ and 
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ri2 = Pcg. The cdf of x is given as [41] 

\ Nt 



\r]2 - % 



+ e 



\ Nt-l i T / \ Nt-l-i / \ i-l 

-^/m ( . y y _J_ f _J1^) f ^) (39) 



Denote qq = rii-m following Lemma \2\ we have 



Ex Nl + X)] 



dx 



l + x 

r°° px/m * Nt-l-i / I roc i-l x/r]i 

' Jo l + x t^t^(^-0!vmy Jo 1 + ^ 

Nt-l i Nt-l-i / I 

a^^h{l/v2, 1, 1) - (1 - ao) E E 7^ " V^, 1,^-0- 



(40) 



1=0 z=o 

where ■, ■) and /2(-, - , ■) are given in (|36l) and (|37l) . respectively. 



D. Proof of Lemma [7] 

Let X = ||h„[n - D]fsm^e ~ r(M - 1,6), y ~ /3(1,M - 2), and x is independent of y. 
Then the interference term due to quantization is z = xy. The cdf of z is 

Pz{z) = P{xy < z) 

""^ (Z\ 

Fy\x y-J fx{x)dx 

fx{x)dx + (^-(l- ^j"""'^ fx{x)dx 
1 - e-^/-^ / ix - z)^'-^-^ ^r^dx 



1 - e-^/^ (41) 
where step (a) follows the equality J^y^e'^'y = Mla'^^^+^l 
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E. Proof of Theorem |?] 

Assuming each interference term in (l27l) is independent of each other and independent of the 
signal power term, denote Y.u'^uPl\K[n " = P^JVi and Y.u'^u KM^^^'^NI' = 

eg„?/2, then from Lemma [7] we have yi ~ X^Nt-iy and y2 ~ X2(Art-i) as e„[r?,] is complex 
Gaussian with variance el „ and independent of the normalized vector f^?^'' [n] . In addition, the 
signal power |h* [n]fl*^^^[n]p ~ xl- Then the received SINR for the w-th user is approximated 
as 

^'^'^ ^ 1 + ^{5,y, + 5,y,) ' ^' ^^^^ 
where a = (3 = ^, 5i = pl6, 62 = e^^, y^ ~ xIm, Vi ~ xLf M = Nt - 1, z r-. xl, and ?/2, 
2; are independent of each other. 

Let y = 5iyi + 52y2, then the pdf of y, which is the sum of two independent chi-square random 
variables, is given as [41] 

M-l M-l 



where 



i=0 i=0 
2 M-l 

= E E (43) 

j=l 1=0 



a'" = 1 ( ^' )" (^)""" (44) 

' 5'+'{M -l)\\Si-Sj i<{M -1-iy. \h-sj ^ ' 

al=' = ' (^) " P'-'^-^-'" (45) 
' 5i*'(M -1)1 \h-Sj i<{M -1-iy. \Si-hJ ' 



The cdf of x is 



Fxix) = P ( < a; 



/■oo 
^0 

poo 

1 - e-'l' / e-f"ylyY{y)dy 



.i=i j=o 



2 M-l . . ^ 



!> dy 
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(46) 



where step (a) follows the equality y^^C'^y = Mla^^^^+i). 



Then the ergodic achievable rate for the w-th user is approximated as 



log2(e)Ex [ln(l + X)] 





(47) 



where step (a) follows from Lemma\2\ step (b) follows the expression of /3(-, - , ■) in (l38l) . 
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TABLE I 
System Parameters 



Symbol 


Description 


Nt 


number of transmit antennas 


U 


number of mobile users 


B 


number of feedback bits 


L 


quantization codebook size, L = 2^ 


F 


average SNR 


n 


time index 


Ts 


the length of each symbol 


fd 


the Doppler frequency 
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TABLE 11 



Mode Switching Points 





fdTs = 0.03 


fdTs = 0.04 


fdTs = 0.05 


MMSE (Simulation) 


44.2 dB 


35.7 dB 


29.5 dB 


ZF (Simulation) 


44.2 dB 


35.4 dB 


28.6 dB 


ZF (Calculation) 


41.6 dB 


32.9 dB 


26.1 dB 




Fig. 1. Approximations and simulations for the ZF system with limited feedback, Nt = U — 4. 




SNR (dB) 



Fig. 2. Comparison of approximation in l |30K the lower bound in ( 1261 ), and the simulation results for the ZF system with both 
delay and channel quantization. B = 10, fc — 2 GHz, v = 20 km/hr, and Ts = 1 msec. 
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Fig. 3. Mode switching between BF and ZF modes with both CSI delay and channel quantization, B = 18, Nt = 4, fc = 2 
GHz, T„ = 1 msec, t; = 10 km/hr. 




Normalized Doppler frequency, f^T^ Codebook size, B 

(a) Different faT,. (b) Different B, fc = 2 GHz, T„ = 1 msec. 



Fig. 4. 



Operating regions for BF and ZF with both CSI delay and quantization, Nt = 4. 
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Codebook size, B 



Fig. 5. Operating regions for BF and ZF with different Nt, /c = 2 GHz, v = 10 km/hr, Ts = 1 msec. 
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SNR (dB) 



Fig. 6. Simulation results for BF, ZF and MMSE systems with delay, Nt = U = 4, faT, = 0.04. 



